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Abstract. In this paper we introduce a neural network model of self-organization. 
This model uses a variation of Hebb rule for updating its synaptic weights, and surely 
converges to the equilibrium status. The key point of the convergence is the update rule 
that constrains the total synaptic weight and this seems to make the model stable. We 
investigate the role of the constraint and show that it is the constraint that makes the 
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model stable. For analyzing this setting, we propose a simple probabilistic game that 

00 ' 

models the neural network and the self-organization process. Then, we investigate the 
-g I characteristics of this game, namely, the probability that the game becomes stable and 
the number of the steps it takes. 

1. Introduction 

How does the brain establish connections between neurons? This question has been 
one of the important issues in Neuroscience, and theoretical researchers have proposed 
various models for self-organization mechanisms of the brain. In many of these models, 
competitive learning, or more specifically, competitive variants of Hebb's rule have been 
used as a key principle. In this paper, we study one property of such competitive Hebb 
rules. 

As one typical example of self-organization, "orientation selectivity" [|WH63|1 has been 



studied intensively. In the primary visual cortex (areal7) of cats, there is some group of 
neurons that strongly reacts to the presentation of light bars of a certain orientation, 
which we call orientation selectivity. An interesting point is that in a very early stage 
after birth, every neuron reacts to all bars of every orientation. This indicates that 
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orientation selectivity is obtained after birth; that is, each neuron selects one preferred 
orientation among all orientations. To explain the development of orientation selectivity, 
a considerable number of mathematical models have been investigated; see, e.g., ||Swi96|| . 
Although these models may look quite different, most of them use, as a principal rule for 
modifying synaptic strength, a competitive variant of Hebb rule, which is essentially the 
same as the rule proposed in the pioneer paper of von der Malsburg |Mal73| | , the paper 
that first gave a mathematical model for the development of orientation selectivity. 

A Hebb rule is a simple rule for updating, e.g., the weight of connection between two 
neurons. The rule just says that the connection between two neurons is strengthened 
if they both become active simultaneously. This rule has been used widely for neural 
network learning. Von der Malsburg constrained this updating rule so that the total 
connection weight of one neuron are kept under some bound. In this paper, we call this 
variation of Hebb rule a constrained Hebb rule. He showed through computer experiments 
that orientation selectivity is surely developed with his constrained Hebb rule. 

Since the work of von der Malsburg, many models have been proposed, and some have 



been theoretically analyzed in depth; see, e.g., [ Tan90 ] . For example, a feature of various 



constrained Hebb rules as a learning mechanism has been discussed in ||MM94|| . Yet, the 
question of why orientation selectivity is obtained by following a constrained Hebb rule 
has not been addressed. Note that the development of orientation selectivity is different 
from ordinary learning in the sense that a neuron (or, a group of neurons) establishes 
a preference to one particular orientation from given (more or less) uniformly random 
orientation stimuli. In this paper, we discuss why and how one feature from equally good 
features is selected with a constrained Hebb rule. 

In order to simplify our analysis, we propose a simple probabilistic game called "mo- 
nopolist game" for abstracting Hebb rules. In monopolist game, an updating rule cor- 
responds to game's rule, and the selectivity is interpreted as that a single winner of a 
game — monopolist — emerges. Then we prove that a monopolist emerges with prob- 
ability one in games following a von der Malsburg type rule. On the other hand, we 
showed theoretical evidence supporting that (i) the chance of having a monopolist is low 
without any constraint, and (ii) a monopolist emerges even under a rule with a weaker 
constraint. These results indicates that the importance of constraint in Hebb rules (or, 
more generally, competition in learning) to select one feature from equally good features. 

We also analyzed how fast the monopolist emerges in games following a von der 
Malsburg type rule. This analysis can be used, in future, to estimate the convergence speed 
of constrained Hebb rules. (In this paper, most of the proofs are given in Appendix.) 
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2. Von der Malsburg's Model and Monopolist Game 

Here we first explain briefly the model considered by von der Malsburg. (Von der Malsburg 
studied the selectivity for a set of neurons, but here we only consider its basic component.) 

Neural Network Structure 

We consider two layer neural network. In particular, we discuss here the orientation 
selectivity for one neuron, and thus, we assume that there is only one output cell. On the 
other hand, the input layer consists of 19 input cells that are (supposed to be) arranged 
in a hexagon like the ones in Figure 1. We use % for indicating the ith input cell, and IN 
for the set of all input cells. 

Stimuli and Firing Rule 

We use 9 stimuli with different orientations (Figure 2), which are given to the network 
randomly. Here • indicates an input cell that gets input 1, and o indicates an input cell 
that gets input 0. 
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Figure 1: Neural network model. 
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Figure 2: Nine stimuli. 

We use Oj to denote input value (either or 1) to the ith input cell. Then output 
value V is computed as V = Th p (J2iew w i a i), where Wi is the current synaptic strength 
between the output cell and the ith input cell. Th p (x) is a threshold function that gives 
x — p if x > p and otherwise, where p is given as a parameter. 

Updating Rule 

Initially, each weight Wi is set to some random number between to some constant. Then 
weights are updated each time according to the following variation of Hebb's rule, which 
we call the constrained Hebb rule (of von der Malsburg). 
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w[ = Wi + c inc aiV, and Wi = w[ x Wq/ ^ w 'k 

\ kei 

Where c inc (which is called a growth rate) and Wq (total weight bound) are constants given 
as parameters. The first formula may be considered as the original Hebb's rule; on the 
other hand, the second one is introduced in order to keep the total weight within Wq. (In 
fact, it is kept as Wq.) 

With this setting, von der Malsburg demonstrated that the selectivity is developed 
through computer simulations. Thus, it seems likely that some selection occurs even 
from uniformly random examples, and that the constraint of the von der Malsburg's rule 
is a key for such a selection. In this paper we would like to study this feature of the 
constrained Hebb rule. For this, we further simplify von der Malsburg's computation 
model, and propose the following simple probabilistic game. 

Monopolist Game 

Basic Rule : Consider a finite number of players. Initially they are given the same 
amount of money. The game goes step by step, and at each step, one of the players 
wins with the same probability. The winner gets some amount of money, while the 
other loses some. 

Details : A player who loses all his money is said become bankrupt. Once a player 
becomes bankrupt, he cannot get any amount of money, though he can still win with 
the same probability. (See below for the motivation.) 

Goal : The game terminates if all but one player become bankrupt. If the survived 
player keeps enough money at that point, then he is called a monopolist. We call a 
situation where a monopolist appears monopoly. 

Notations. We use n and n' to denote the number of players and that of remaining (not 
being bankrupt) players, and use i, 1 < i < n, to denote players' indices. Throughout 
this paper, each player's wealth is simply called a weight, and let w,i denote the player i's 
current weight. Let / and Wq respectively denote the initial weight of each player and 
the total amount of initial weights; that is, Wq = nl. 

The connection of this game with von der Malsburg's computation model is clear; each 
player's weight corresponds to total synaptic strength between the output cell and a set 
of input cells corresponding to one type of stimulus, and the emergence of a monopolist 
means that the network develops preference to one orientation. From this correspondence, 
it is natural to require that even a bankrupt player can win with the same probability 
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1/n, which reflects the fact that the probability of a stimulus of each orientation appears 
is the same no matter how neural connections are organized. 

An updating rule of players' weights corresponds to a rule of changing synaptic 
strength in the network. Here we can state updating rules in the following way. (In 
the following, let io denote the player who wins at the current step.) 

] Wi + / inc - / dec , if % = i , and 
{ Wi- / dec , otherwise. 

Here / inc and /d ec are the amount of increment and decrement at each step respectively, 
and one type of monopolist game is specified by defining / inc and /dec- In the following, 
we assume that these values are determined from w iy w io , n, and n' . From the relation 
to von der Malsburg's computation model, we require that both / inc and /d ec are if 
Wi = 0; that is, once a player loses all money, he stays forever in the weight state. (In 
the following, we will omit stating this requirement explicitly.) 

Now we consider the rule that corresponds to the constrained Hebb rule of von der 
Malsburg's rule. It is defined as follows with constant c inc . 

/inc = Qno and /dec = C'mc/ n ■ (1) 

(Recall that n' is the number of currently remaining players.) 

Note that with this rule, the total amount of wealth is kept constant. Thus, in this 
sense, it corresponds to von der Malsburg's rule, and we call it constrained rule. Note that 
we may also consider a similar rule such that /i nc is not constant but proportional to Wj. 
(Similarly, /d cc is also proportional to Wi ) This rule might be closer to the original von der 
Malsburg's rule. This difference is, however, not essential for discussing the probability 
of having a monopolist, i.e., for our discussion in Section 3. On the other hand, there is a 
significant difference in convergence speed; but roughly speaking, the difference disappears 
if we take the log of weight. Thus, we will discuss with the above simpler rule. 

3. Importance of Competition 

Here we compare three different updating rules for monopolist game, and show that 
constraint is important to derive a monopolist. From this, we could infer that some sort 
of constraint, (or, competition in more general) is important in learning rules for selecting 
one among the others through random process. 

In the following, we consider the following three updating rules: (1) constrained rule, 
(2) local rule, and (3) semi local rule. Below we define these rules (except (1) that has 
been defined in the previous section) and discuss the probability P* that a monopolist 
emerges. 
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Constrained Rule 

We show that under constrained rule, P* is 1, that is, a monopolist emerges with proba- 
bility 1. 

A monopolist game in general expressed by an one-dimensional random walk. More 
precisely, for any i, we can express the player i's wealth Wi as the following random walk. 

w 



f dec f it 



Figure 3: One- dimensional random walk. 

Note that the particle (i.e., the weight moves to the left (resp., to the right) with 
probability 1 — 1/n (resp., 1/n). The left (resp., right) end of the interval means that 
the player i becomes bankrupt (resp., a monopolist). Thus, these two ends are absorbing 
walls. 

In a monopolist game under constrained rule with n = 2, we have /i nc = c- inc /2 and 
/dec = Cinc/2. Hence, the above random walk becomes standard one (see, e.g., |[Fel68| ), 
and it is well-known that the particle in such a standard random walk goes to one of the 
absorbing walls in finite steps with probability 1. This proves that P* = 1 when n = 2. 
Then by induction, we can prove P* = 1 when n > 2; thus, we have the following theorem. 

Theorem 3.1. Under constrained rule, a monopolist emerges (in finite steps) with prob- 
ability 1. 

Local Rule 

In constrained rule, for computing /dec, we need the number of remaining players; that 
is, weights cannot be updated locally. In general, in order to be competitive, an updating 
rule must not be local. Thus, to see the importance of competition, we consider here the 
following purely local updating rule. 

/inc Cine 5 and /dec Qiec- ( / 

Notice that for this local rule (and the next semi local rule), the notion of monopolist 
is less clear than constrained rule, because the notion of "enough amount of money" is 
not clear. Here we simply consider it as Wo/2, a half of the total initial weight. That is, 
we regard a single surviver as a monopolist if his weight is more than Wo/2; hence, P* 
is the probability that the game reaches to the state where > Wq/2 for some i\ and 
Wi = for the other i. 
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We first discuss one feature of this updating rule. In the following, let us fix Cd ec = 1- 
Our computer experiments show that the probability to have a single surviver (in a 
reasonable amount of steps) drops rapidly when c inc > n + 1. The reason is clear from 
the following fact. 

Theorem 3.2. Fix Cd ec to be one, and consider one player's weight. For any t, it increases, 

by t(-^- — 1) on average, after t steps. 
^ n ' 

Thus, if c inc > n, then it is quite likely that all players increase their weights, and 
thus no bankrupt appears in the game. On the other hand, if c; nc < n, then every player 
dies quickly, and hence, no monopolist occurs even though someone may become the last 
player. This means that the most crucial case is the CclSG C'hyq — 77/. Next we discuss P* for 
such a case. 

Recall that P* is the probability that, at some point in the game, all but one become 
bankrupt and that the survived player has weight > Wq/2. Since it is difficult to estimate 
P* directly, we analyze the following probability Pi instead of P*: Pi is the probability that 
at least one player's weight reaches to Wq/2 and no more two players have weight larger 
than sufficiently large value, say, kW for some k > 0. Notice that if a monopolist emerges 
at some point, then clearly, someone needs to reach Wq/2 in the game. Furthermore, it 
is unlikely that two players reach to kWo and one of them become bankrupt afterwards. 
Thus, we may regard Pi as an upper bound of P*. For this Pi, we have the following 
bound. 

Theorem 3.3. For any W\ and sufficiently large W 2 , we have 

For example, by taking W\ and W 2 as ^ and knl respectively, we have 

Pi < e -V(W+l) + e -/(n-l)/(fc/+l)n /A ._ e -2 „ (1 + l/jfe) e -V* _ 

which is less than 0.6 if k = 1. On the other hand, our computer experiments show that 
P* is less than 0.5 for various sets of parameters. 

Semi Local Rule 

As a third updating rule, we consider somewhat mixture of the above two rules. It keeps a 
certain amount of locality, but it still has some constraint. This rule is defined as follows. 

f hlc = min(c inc , W - J2 w i)' and /** = c dec- (3) 

j 
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That is, we want to keep the total weight smaller than W , where W is the total initial 
weight. Thus, a winner can gain o inc (in net, o mc — Cd ec ) if there is some room to increase 
its weight. In this case, only the winner needs to know the current total weight, or the 
amount of room to the limit W , and the other players can update its weight locally. 

Our computer experiments show that the probability P* that a monopolist emerges 
is fairly large if c inc is large enough, say c inc > 2n. On the other hand, P* gets small when 
c inc is small, which is explained in the same way as local rule. Although we have not been 
able to prove that P* is large for sufficiently large c inc , we can give some analytical result 
supporting it. 

Here instead of analyzing P*, we estimate (i) the average number of steps until all but 
one players become bankrupt, and (ii) the average number of steps until the total weight 
(which is initially Wo) becomes Wo/2. Let T n ^i and Pwo->VFo/2 denote the former and the 
latter numbers respectively. We prove below that T n ^i is smaller than T Wo ^ Wo / 2 if W 
is large enough. This means that it is likely that at the time when all but one players 
become bankrupt, the total weight, which is the same as the survivor's weight, is larger 
than Wo/2, that is, the surviver is a monopolist. 

Theorem 3.4. Fix again c dec to be one. If / > (m6)n(n — 2) and c inc > 2n, then we 
have T Wo ^ Wo/2 > 

4. Efficiency Analysis 

In this section we discuss how fast a monopolist emerges in games with constrained 
rule. We estimate an upper bound on the average number of steps needed for monopoly 
to emerge, and we give some justification (not a rigorous proof) supporting that it is 
0{n\ln/c hlc f). 

We start with some definitions and notations that are used through the section. Here 
we modify our monopolist game and define a variant of monopolist game. Let game 
denote the original monopolist game. We will denote by game x a variant of game in 
which no bankrupt player can win. Thus, in game 1; the winning probability of remaining 
players is 1/n' instead of \/n. As we will see game x is useful for induction and it is easier 
to analyze. 

These two game types are defined on different probability spaces. Let us define them 
more precisely. For all two game types, (the execution of) a game is specified by a 
game sequence, i.e., a string from {1, . . . , n}* that defines a history of winners. (Precisely 
speaking, we also need to consider infinite strings; but as we see below, we may ignore 
infinite strings.) We say that a game sequence x kills a player % if Wi becomes (or, 
negative) in the game following x just after the end of x, and we say that x derives a 
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monopolist if the second last player is killed and monopoly emerges just after x. We say 
that a game sequence x is valid (resp., strongly valid) if it derives a monopolist and no 
prefix of it derives a monopolist (resp., x contains no indices of previously killed players). 
Note that the meaning of these notions may vary depending on game types. Now for any 
n, let X n (resp., Y n ) be the set of game sequences for n player games that are strongly 
valid w.r.t. game (resp., valid w.r.t. game-^. For each x in X n , its probability Pr{a;} is 
On the other hand, the probability Pr{y} of y E Y n depends on the number of 
remaining players, and it is rather complicated. (We omit specifying Pr{y} because it 
is not important for our discussion.) Note that X n and Y n are all prefix free. Also it is 
not hard to show that Pr{X n } and Pr{Y„} are one. (For example, Pr{X ra } = 1 follows 



from Theorem |3.1| .) Therefore, we may regard X n and Y n as a probability space of the 
corresponding game, and we do not have to worry about infinite strings. 

We denote by T(n, ii, . . . , I n ) (resp., Ti(n, Ii, . . . , I n )) the number of steps needed un- 
til monopoly emerges in game (resp, game x ) with n players and initial weight Ji, . . . , I n . 
When all the weights are equal, we use the simpler notation T(n,I). Our goal is to 
get some upper bound on E[T(n, I)]. But instead, we will analyze an upper bound on 
E[7\(n, I)], which gives us an upper bound on E[T(n, /)], as the following lemma guaran- 
tees. (The proof is intuitively clear and it is omitted in this abstract.) 

Lemma 4.1. There exists c\ such that for any sufficiently large n and any /, we have 
E[T(n,/)] < cinEpZifaJ)]. 

Now we analyze the convergence speed of game x . For our analysis, we split a game 
execution into stages where each stage is a part of the game until some amount of play- 
ers become bankrupt. More specifically, we denote by ti(n, Ix, . . . , I n ) the number of 
steps needed in a game with n players and initial weights I\,...,I n until at least 1 
player becomes bankrupt. The following lemma relates the two terms Ti(n,Ii, . . . ,I n ) 
and ti(n, I t , . . .,/„). 

Lemma 4.2. For any n and 1%, ■ ■ ■ ,I n , there exists a constant C2, > 1, and weights 
I' n _ C2 such that the following inequality holds. 

E[7\(n, /!,...,/„)] < E[h (n, A,..., !„)] + E[7\(n - c 2 , l' n _J}. 

By this lemma we can use induction for bounding the expected value of T\. Recall that 
when analyzing t 1 (n, Ji, . . . , J n ), by the way it is defined, no player becomes bankrupt, 
and thus, the amount of decrement is fixed to c- mc /n. Thus, game x until at least one player 
becomes bankrupt is regarded as a n-dimensional random walk, which is much easier to 
analyze. In fact, we can use the following lemma. 
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Lemma 4.3. Let X be a random variable that is 1 with probability 1/n and with 
probability 1 — 1/n, and let S = X\ + . . . + X t , the sum of the outcomes of t random trials 
of X. Then, for some constant a > 0, the following holds for any t and n. 




Now we are now ready to make the following claimj] 
Claim. 

E[T(n,/)] = o(n*(— ) 



2\ 



Justification. We start with estimating E[ti(n, Jx, . . . , I n )} by using the above lemma. 
For a given t and for any i, Let t« be the number of times that player i wins within t 
steps. Then Wi, the weight of player i, in game x is expressed as follows. 

( 1 

\ n 

(For simplifying our notation, we use c to denote o mc in the following.) 

Moreover, since game x until at least one player becomes bankrupt is regarded as a 
n-dimensional random walk, we can use Lemma |4.3| to show that the following event 
happens with probability bigger than 1/3. 

Wi = Ii + c(u--\ < L + cl a\ ] = L-ca\ -. 



V nj \n \ n n J \ n 

Therefore, with probability more than 1/3, the weight of player % becomes zero or 
negative if ca^Jt/n > Ii, that is, t > (Ii/ca) 2 n. Now sort players by their initial weights, 
and define P to be the set of the first (i.e., the smallest) n/2 players. Since the total 
weight is W (at any step), all players in P have weight at most 2W /n and therefore, 

flW \ 2 1 

Pr{ Wi < in t = n ( ) steps | i G P } > -. 

V nca J 3 

Moreover, if we can assume that each player in P become bankrupt independently, 
we also have the following probability: 

/2 \ n//2 

Pr{ There exists % G P, such that Wi < in t steps } > 1 — ( - 



1 We do not have a rigorous proof for this result, and for this reason we stated it as a claim. 
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From this observation, it is reasonable to bound E[t 2 (n, J 1; . . . , I n )} by c 3 t for some 
constant c 3 since for most of the valid game sequences (a 1 — (2/3) n ^ 2 fraction of them) 
this bound holds. 

Now combining the above lemmas and the obtained bound, we have 

EpKn,/)] < E[t! (n, h, . . . , Q] + E[T 1 (n- c 2 , ,I' n _J] 

< +E[Ti(n -02, /{,..., i;_ C2 )] 



2 / OTI/ \ 2 



2WoV / 21f 



< c 3 n — ^ +c 3 (n-l) ? + E[T!(n- 4, 

V nca / V [n — l)ca I 2 



(W Q \ 2 fln\ 2 
< c 3 n I — I < c 4 n I — I , tor some constant c 4 . 



From this and Lemma 4.1, we obtain the desired bound. □ 
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Appendix 

Here we state the proofs of lemmas and theorems not stated in the body of the paper. 
Section 3 



We begin with Theorem |3.2|. To prove it, we first estimate, the probability Piw that 



one player, say player 1, whose initial weight is / obtains weight W at some point in the 



game. By using results in random work [[Fel68|l , we can easily analyze this probability and 
obtain the following bounds. 

Fact 1. For any / and W, we have — < Pj w < 777- 

W + n W 



Theorem |3.2| Fix Cd ec to be one, and consider one player's weight. For any t, it increases, 

by — 1) on average, after t steps. 

v n ' 

Proof. Consider Pi with W\ and sufficiently large W%. Note first that 

Pr{less than two players' weight reach to W 2 } = (1 — Piy/^) n + nP^w 2 {l — Pi,w 2 ) n l 
Prjno player's wealth reaches to W{\ = (1 — Pj\ 

Then by definition of Pi, we have 



,Wi) ■ 



< 
□ 



Pi — (1 ~ Piy/^T + ^-P/,w 2 (l — Pi,w 2 ) n ~ (1 — Pi,Wi) n 

' Til / ^ I \ n-1 / I \n 



w 2 + n; w 2 v w 2 + n; v w x 



Next we prove Theorem |3.4j . In the following, we only consider games with semi local 
rule specified in the theorem; that is, the game starts with n players, the bound is the 
same as the total initial weight Wo = nl, and o inc > 2n. 

Below, for a given k < n, we consider the situation that k players are left, and analyze 
their total weight and the weight of the poorest player. For this, we use random walks 
representing respectively, the total weight, a random walk for the total, and the weight 
of currently the poorest player, a random walk for the poorest. These random walks are 
expressed as follows. 
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Figure 4: Random walk for the total (above) and for the poorest (below) 



Since the total weight is at most Wq, the poorest player cannot have more than Wo/k. 
Also as soon as the poorest player obtains Wo/k, he cannot be the poorest, and he (or, 
his role) is replaced with some currently poorest player, whose weight is again less than 
Wo/k. In any case, if we only consider the weight of the poorest player, we can assume 
that the rightmost end is a reflecting wall. On the other hand, by definition of semi local 
rule, the rightmost end of a random walk for the total is a reflecting wall. Thus, in both 
random walks, the rightmost walls are reflecting walls; hence, the particles are eventually 
absorbed in the leftmost walls. Here we discuss the difference between the number of 
steps until the particles are absorbed. 



Theorem |3.4| Fix again Cd ec to be one. If I > (ln6)n(n — 2) and c- mc > 2n, then we 



have T Wo ^ Wo / 2 > T n ^\- 

Proof. We first give an upper bound to T n -*i, the average number of steps until all but 
one players become bankrupt. For any k < n, consider the situation in the game that k 
players are left, and let TP(k) be the average number of steps that some player becomes 
bankrupt from this situation. 

By Lemma A.l below, we have 

T n ^ < £TP(A;) 

k=l 

< n ( e Wo/2n + e W /3n + _ _ _ + e W(« 2 )) < ne W /2n + ^ _ 2 ^W /3n < 2 ne Wo '\ 



where the last inequality holds from our assumption Wo > n(n — 2) In 6. 

Next consider T Wo _+ Wo / 2 , the average number of steps needed until the total weight 
becomes Wo/2 from Wo- Let TT(fc, x,y) be the average number of steps until the total 
weight becomes y from x under the condition that none of remaining k players become 
bankrupt. Note first that for any k and k! such that k < k', we have TT(k,x,y) < 
TT(k' ,x,y). Then, for any xo, ...,x n s, we have 

TT{n,Wo,x )+TT{n-l,x ,x 1 ) + --- + TT{2,x n - 3 ,W /2) > TT(2, W , Wo/2). 

Hence, T Wo ^ Wo/2 > TT(2, W , W /2). On the other hand, TT(2, W , W /2) is bounded 
in the following way by using Lemma A. 2 below and the assumption that c; nc > 2n. 

TT(2,Wo,Wo/2) > ((l+ ) 2 -l)(l- e - 



-2 



2 ^ ii 

> (e 5 ^T(l - e~ 2 )) > e W /2(n-2)_ 
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Comparing both bounds, we have T Wo _> Wo / 2 > T n ^ x if W > n(n — 2) In 6. □ 

Lemma A.l. Consider the situation in the game that k players are left, and let TP(A;) 
be the average number of steps that some player becomes bankrupt from this situation. 
Then we have TP (A;) < ne« k . 

Proof. Consider a random walk for the poorest. As explained above, we may assume 
that the random walk has a reflecting wall at W /k and an absorbing wall at 0. Here 
for showing an upper bound, we assume, as the worst case, that the initial weight of the 
poorest is W /k; that is, the random walk starts off at W /k. Then TP(/c) is bounded by 
the average number of steps until the particle reaches to the absorbing wall at 0. 

Let t x be the random variable denoting the number of steps starting off at x and 
arriving at x — 1 for the first time. Then we can evaluate t x by 



ta 



1, with probability 1 — 1/n, and 

1 + Wc inc -i + • • • + tx+i + t x , otherwise. 



Define e x to be the expectation of t x . Then we have 

n 1 

H T\ e x+c inc -i + . . . + e x +i, 



n — 1 n — 1 

Then by induction on x, we can show that ew /k-x < (1 + l/(n — l)) x . Thus, we have 
TP(fc) < E e w /k~ x < (^- 1 )( 1 + ^t) " ~ (n-l)e^. 



□ 



Lemma A. 2. For any x and y, x > y, we have 

n— 2// 2 ■. w °~ v / 2 \ ^o- 31 . 

Proof. Consider a random walk for the total (see Figure 4). By dividing all parameters 
by k = 2, we can modify it to essentially the same random walks as Figure 5. 

i _ t k W 

A n ri -r- 



x 



1 Cjnc 1 

T 



Figure 5: A modification of a random walk for the total 
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Note that this random walk is quite similar to a random walk for the poorest (see 
Figure 4 below). Thus, for e x , the average number of steps starting off at x and arriving 
at x — 1 for the first time, a similar argument derives that the following relation (here c 

Cinc/2)- 

2 

a x > H -(e^+c-i + • • • + e x+ i). 

n — 2 

Then it is not so hard to see that 

/ 2 \ x / 2 \x— c+i / 2 \i/ / 2 \ — c+i\ 

^ > (i + —J -(i + — 2 ) = (i + ^) (i - (i + ^) ). 

Now by using our assumption that c = c- mc /2 > n, we can derive the following bound. 

(Wo-tO/2-l 

v--\ / 2 \x / / z \ -c+l 

Tr(2,*,») > £ (l + JTTa) ( 1 -( 1 + ST2 

x=(W -x)/2 '* z ' 4 ^ 

n-2//. 2 

// - 2-' y " ;/ - 2 

□ 

Section 4 



> —^-((1 + —^) ' -(! + —-) 2 (1-e- 2 ) 



Lemma |4.2| For any n and Ii, . . . ,I n , there exists a constant C2, C2 > 1, and weights 



Z^_ C2 such that the following inequality holds. 
EpKn, /„)] < E[t! (n, A,..., 7 n )] + E[T x (n - c 2) /(,..., CJ]. 

Proof. Let 7 C F n be the set of all valid game sequences y such that the number of 
players becomes strictly smaller than n for the first time just after y. By definition of 
E[Ti(n, h, . . . , /„)], we have the following equality. 

E[T a (n, /!,..., /„)] = EW'N = E fM^KM + M)- 

Notice here that we can split Pr{yz} in two factors, Pr{y} and Pr y {z}, where Pr y {z} 
determines the probability of z after the game follows y. Also note that the set Y y of 
strongly valid z depends on y. Thus, we can rewrite the above expression as follows. 



E[Ti (n, h,... .Jn)] 

= E E Pr{i/}Pr«W • \y\ + E E ■ 



2 



-2 



£ Pr{y}|y| E Pr s {*} + E Pr M E 
= E[t!(n, 7i, Q] £ Prjz} + £ Pr^E^K, J il(tf) , . . . , J^)] 

< E[tx(n, Ji, . . • , In)] + E[T!(n - c', , C c ,)]- 



16 



where the values of n y and Ii 1 ( y ), ■ ■ ■ , Ii ny (y) are determined by the result of the 
game following y. On the other hand, c' and l[ are chosen so that the value of 
i n yi hi liny (v))] * s maximized. These values always exist since even if there 

is an infinite number of game sequences y that appear on the summation, there is only a 
finite number of possible values for n y (since n y must be between 1 and n — 1) and h^y) 
(since Y.jh^y) = In). □ 



Lemma |4.3| . Let X be a random variable that is 1 with probability 1/n and with 
probability 1 — 1/n, and let S — X\ + . . . + X t , the sum of the outcomes of t random trials 
of X. Then, for some constant a > 0, the following holds for any t and n. 




Proof. We estimate the probability that the statement of the lemma is false and show 
that it is less than 2/3. That is, we upper bound the following probability. 



Pr(S>--aJ- 
n V n 



Pi- IS > - 

n 



Pr(i>S>--aJ- 
n n V n 



The first term in the sum is bounded by 1/2; see, e.g., | JS72 |. Let s be the smallest 
integer such that s > t/n — a^Jt/n. We calculate, by using Stirling's approximation, the 
second term of the above sum as follows. 



Pi' { - > S > - - a 

n n V n 



g (t\ {n - ly-' 



rr 
1)*-* 




Also routine calculations show that ( ~?^E \ j ) ( t(n-\) ) ^ s always less than 1 for s < i < 
t/n and that this factor is maximized when % = t/n. From this by simple calculation, we 
obatain the desired bound with a = Jn/12. 



17 



